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ABSTRACT 

Protein/Peptide microarrays are rapidly gaining momen- 
tum in the diagnosis of cancer. High-density and high- 
throughput peptide arrays are being extensively used to de- 
tect tumor biomarkers, examine kinase activity, identify an- 
tibodies having low serum titers and locate antibody sig- 
natures. Improving the yield of microarray fabrication in- 
volves solving a hard combinatorial optimization problem 
called the Border Length Minimization Problem {BLMP). 
An important question that remained open for the past seven 
years is if the BLMP is tractable or not. We settle this 
open problem by proving that the BLMP is ./VP-hard. We 
also present a hierarchical refinement algorithm which can 
refine any heuristic solution for the BLMP problem. We 
also prove that the TSP+l-threading heuristic is an O(N)- 
approximation. 

The hierarchical refinement solver is available as an open- 
source code at http://launchpad.net/blm-solve 

Categories and Subject Descriptors 

F.2 [Analysis of Algorithms and Problem Complex- 
ity]: [Complexity of proof procedures]; G.2.2 [Graph The- 
ory]: [Graph Algorithms]; F.1.3 [Complexity Measures 
and Classes]: [Reducibility and completeness] 

General Terms 

Border length minimization, Quadratic assignment, Microar- 
ray optimization, Appoximation algorithms, Computational 
biology 

*Graduate student, Computer Science and Engineering De- 
partment 

t D r. Rajasekaran, UTC Chair processor, Computer Science 
and Engineering Department 

■^Graduate student, Computer Science and Engineering De- 
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1. INTRODUCTION 

Cancer diagnosis research has taken a new direction recently 
by adopting peptide microarrays for reliable detection of tu- 
mor biomarkers (Chatterjee, et al., pQ), (Melle, et al., [7]), 
(Welsh, et al., [8]). These high-throughput arrays also find 
application in examining kinase activity, identifying anti- 
body signatures against tumor antigens, etc. High-density 
peptide arrays are currently fabricated using technologies 
such as photolithography or in-situ synthesis based on mi- 
cromirror arrays. The manufacturers of these arrays are 
facing serious fabrication challenges due to unintended illu- 
mination effects such as diffraction and scattering of light. 
These illumination effects can be reduced dramatically by 
selecting a right placement of the peptide probes before fab- 
rication. Finding this placement can be formulated as a 
combinatorial optimization problem, known as the Border 
Length Minimization Problem (BLMP). Hannenhalli, et al. 
first introduced BLMP in 2002 [J. Although the BLMP 
was formulated in the context of DNA microarrays, peptide 
arrays share a similar fabrication technology. 

The BLMP can be stated as follows. Given N 2 strings of 
the same length, how do we place them in a grid of size 
N x N such that the Hamming distance summed over all 
the pairs of neighbors in the grid is minimized? The BLMP 
has received a lot of attention from many researchers. The 
earliest algorithm suggested by Hannenhalli, et al. reduces 
BLMP to TSP (Traveling Salesman Problem) by comput- 
ing a tour of the strings and then threading the tour on the 
grid [1]. Kahng, et al. have proposed several other heuristic 
algorithms which are considered the best performing algo- 
rithms in practice [5]. De Carvalho, et al. introduced a 
quadratic program formulation of the BLMP but unfortu- 
nately the quadratic program is an intractable problem [3]. 
Later, Kundeti and Rajasekaran formulated the problem as 
an integer linear program which performs better than the 
quadratic program in practice [BJ. 

Despite many studies on the BLMP, the question of whether 
BLMP is tractable or not remained open for the past 7 years. 
In this paper, we show that the BLMP is A/'P-hard. We 
also consider a generalization of the BLMP called the Ham- 
ming Graph Placement Minimization Problem (HGPMP). 
We show that some special cases of the HGPMP are also 
ATP-hard. On the algorithmic side, we show that a simple 
version of the algorithm suggested by Hannenhalli, et al. is 
an 0(iV)-approximation. On the practical side, we propose 



a refinement algorithm which takes any solution and tries 
to improve it. An experimental study of this refinement 
algorithm is also included. 

Our paper is organized as follows. Section [2] formally de- 
fines the BLMP and HGPMP. Section [3] provides the MV- 
hardness proof of the BLMP and some special cases of the 
HGPMP. Section[4]gives the 0(TV)-approxfmation algorithm 
and the refinement algorithm for the BLMP. Section [5] pro- 
vides an experimental evaluation of the refinement algo- 
rithm. Finally, Section [6] concludes our paper and discusses 
some open problems. 

2. PROBLEM DEFINITION 

Let S be a set of strings of the same length with S — 
{si, . . . ,s„} and let G — (V,E) be a graph with \V\ = n. 
A placement of S on G is a bijective map / : S — > V. Let 
be the string that is mapped to vertex u by the 
placement /. We denote the Hamming distance between 
two strings Si and Sj as S(si,Sj). The cost of placement / 
is Cost(f) = E^^B'K.r 1 ^),/- 1 ^))- The Hamming 
Graph Placement Minimization Problem (HGPMP) is de- 
fined as follows. Given 5* and G, find a placement of S on G 
of minimum cost. We denote the optimal cost as OPT(S, G), 
or simply as OPT if it is clear what S and G are. 

Obviously, if G is a ring graph, then HGPMP is the same 
as the well-known Hamming Traveling Salesman Problem 
(HTSP). If G is a grid graph of size TV x TV (where TV 2 = 
n), then HGPMP becomes the Border Length Minimization 
Problem (BLMP), which is the main study of our paper. 

3. yVP-HARDNESS OF THE BLMP 

AND HGPMP 

Theorem 1. The BLMP is NT -hard. 

We will show that the Hamming traveling salesperson prob- 
lem (HTSP) for strings (with the Hamming distance metric) 
polynomially reduces to the BLMP. The HTSP is already 
defined in Section [2] 

The idea of the proof is that given 4TV strings for the HTSP 
we construct (TV + l) 2 strings for the BLMP such that from 
an optimal solution to this BLMP, we can easily obtain an 
optimal solution for the HTSP. So we need to consider the 
variant of the HTSP in which the number of strings is divis- 
ible by 4. The proof will be presented in stages. The next 
three subsections present some preliminaries needed for the 
proof of the theorem. Followed by these subsections, the 
proof is presented. 

3.1 4TV-strings traveling salesperson problem 

Define an instance of the HTSP as a 4TV -strings HTSP if 
the number of strings in the input is 4TV (for some integer 
TV). In this section we show that the 4TV-strings HTSP is 
./VP-hard. 

Theorem 2. AN -strings HTSP is MV-hard. 

Proof: We will show that the HTSP polynomially reduces 
to the 4TV-strings HTSP. Let S = {si, s 2 , ■ ■ ■ , s n } be the 



input for any instance of the HTSP. Let £ be the length 
of each input string. Append a string of 2n£ O's to the 
left of each Si to get S; (for 1 < i < (n — 1)). For ex- 
ample, if n = 4, £ — 3 and si = 101, then Si will be 
000000000000000000000000101 . We append 2n£ T's to the 
left of s n to get s' n . We will generate an instance S' of 
the 4TV-strings HTSP that has as input 4TV strings, where 
TV = [~] . S' will have s[, s' 2 , . . . , s' n _ 1 and 1, 2, 3 or 4 copies 
of s'„ depending on whether n — 4TV, 4TV — 1,4TV — 2, or 
4TV — 3, respectively. 

It is easy to see that in an optimal tour for the above 4TV- 
strings HTSP instance, all the copies of s' n will be successive 
and that an optimal solution for S can be obtained readily 
from an optimal solution for S' . □ 

3.2 A special instance of the BLMP 

Consider the following (TV + 1) strings as an input for the 
BLMP: 

ti, t 2 , . . . , t4N, t,t,...,t. Here there are TV 2 — 2TV+1 copies of 
t. There is a positive integer k such that 5(ti, t) = k for any 
1 < i < 4TV and 2k > 5(U,tj) > \k for any 1 < i / j < 4 TV. 

Lemma 1. In any optimal solution to the above BLMP 

instance, ti,t 2 , , Ian will lie on the boundary of the (TV + 

1) x (TV + 1) grid (see Figure^. 

Proof: This can be proven by contradiction. Let T be 
the collection of the strings ti,t 2 ,..., t^M- Let q be one of 
the strings from T that has a degree of 4 in an optimal 
placement. Let r be one of the strings equal to t that lies 
in the boundary. Next we show that we can get a better 
solution by exchanging q and r. 

Let u be the number of neighbors of q from T. Let v be the 
number of neighbors of r from T. Note that < u < 4 and 
< v < 3. In the current solution, the total cost incurred by 
q and r is at least |fcu + fc(4 — u) + kv = jku + kv + Ak. If we 
exchange q and r, the new total cost incurred by q and r is 
strictly less than ku + 2kv + k(3 — v) = ku + kv + 3k. The old 
cost minus the new cost is strictly greater than k — jku > 0. 

We thus conclude that all the strings of T lie on the bound- 
ary of the grid in any optimal solution. □ 

3.3 A special set of strings and some opera- 
tions on strings 

We denote the (ordered) concatenation of two strings x and 
y as x + y. If x and x (respectively y and y') have the same 
length then, clearly, 5(x + y,x' + y') = S(x, x') + S(y, y'). 

Given a string x = x\x 2 . . . xi and an integer h, let REPh(x) 
be the string x±xi . . . x\x 2 x 2 ■ ■ ■ x 2 . . . xiXi . . . xi, where each 
Xi appears h times (REP stands for "replicate"). It is not 
hard to see that if x and y have the same length, then 
6(REP h (x),REP h (y)) = hS(x,y). 

Given an integer n, we can construct a set of n strings of 
length n each, A n ~ {oi, a 2 , . . . , a n }, such that 8(ai, a,) = 2 
for any 1 < i j < n. One way to construct A n is to let 
a t = 00 . . . 0100 . . . 0, where there are (i - 1) 0's before T. It 
is easy to check that S(ai, aj) — 2 for any 1 < i ^ j < n. 



This completes the proof of Theorem \T\ □ 
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3.5 Afp-hardness of the HGPMP for other spe- 
cial cases 

We can generalize the result in Theorem [1] for other special 
cases of the HGPMP. We say graph G is "bordered-ring" if G 
is undirected and G has a ring of size Sl(n a ) for some con- 
stant a > such that every vertex in the ring has degree no 
greater than d and every vertex outside the ring has degree 
greater than d for some d > 3. For grid graphs, a — i and 
d = 3. Some variants of grid graphs like Manhattan grids 
are bordered-ring as well. 



Figure 1: An illustration for Lemma [T] with TV = 4. 
Each U lies on a dark vertex in the grid. 



Theorem 3. The HGPMP is NT -hard even ifG is bordered- 
ring. 



3.4 Proof of the main theorem 

Now we are ready to present the proof of Theorem [1] Let 
S — {si, S2, ■ ■ ■ , S4iv} be the input for any instance of the 
47V-strings HTSP. Each Si has the length I. We will generate 
(7V + 1) 2 strings such that an optimal solution for the BLMP 
on these (TV + l) 2 strings will yield an optimal solution for 
the 47V-strings HTSP on S. 

The input for the BLMP instance that we generate will be 
T = {ti,t 2 , ■ ■ .,t 4 N,t,t, ...,*} where t occurs TV 2 — 2N + 1 
times. We set ti = REPh(ai) + REP^Sj), where at is the 
i-th string in the set A^m defined in subsection 13.31 We will 
choose h later. Also, we set t = REP 4Nh (0) + 0101 . . .01, 
where the string 01 is repeated I times. We can easily check 
that: 

8{ti,t) = h + l for any 1 < i < 47V (1) 
5(U,tj) = 2h + 2S(si, Sj) < 2h + 21 (2) 
for any 1 < i ^ j < AN 

We choose h so that T satisfies the condition in Lemma 
[T] Particularly, choose h = 81. Now we will show that 
OPTblmp{T) = 4(N -l)(h + l) + 8Nh + 20PThtsp{S), 
which in turn means that an optimal solution for the BLMP 
on T will yield an optimal solution for the 47V-strings HTSP 
on S. 

Let A — Si 1 ,Si 2 , . . . , Si iN be an optimal tour for the 47V- 
string HTSP on S. We construct a solution A' for the BLMP 
on T by placing ti's on the border of the grid in the order 
ti ± , ti 2 , . . . , ti 4N and placing the copies of t on the center of 
the grid. By the equalities |T} and ((2j), the cost of A' is 
Cost{A') = 4(7V - l)(h + I) + 8Nh + 2Cost(A). Therefore, 
OPTblmp(T) < 4(7V - l)(h + I) + 8Nh + 20PT H tsp(S). 

On the other hand, let B be an optimal solution for the 
BLMP on T. By Lemma [T] ti 's lie on the border of the 
grid and the copies of t lie on the center of the grid. As- 
sume that ti's lie in the order ti ± , U 2 , ■ ■ ■ , i; 4JV . We can con- 
struct a tour B' for the 47V-strings HTSP on 5* in the order 
Si ± , Si 2 , . . . , s iiN . By the equalities ([TJ and ([2]), Cost(B) = 
4(N -l){h+l) + 8Nh + 2Cost{B'). Hence, OPT BLM p(T) > 
4(7V - l)(7i + I) + 8Nh + 20PT H tsp(S). 



Proof: By a similar reduction to that of the BLMP above, 
the theorem follows. □ 

3.6 An alternate AAp-hardness proof 
for the BLMP 

In this section, we give an alternate A/'P-hardness proof for 
the BLMP by showing that another variant of the HTSP 
called fc-Segments HTSP polynomially reduces to the BLMP. 
We believe that the techniques introduced in both of our 
proofs will find independent applications. 

3.6.1 k-Segments traveling salesperson problem 

We define the fc-segments HTSP and show that it is NP- 
hard. Consider an input of n strings: si, S2, ■ ■ ■ , s n . The 
problem of fc-segments HTSP is to partition the n strings 
into k parts such that the sum of the optimal tour costs for 
the individual parts is minimum. 

Theorem 4. The k-segments HTSP for strings is NT- 
hard. 

Proof: We will prove this for k = 4 (since this is the in- 
stance that will be useful for us to prove the main result) 
and the theorem will then be obvious. 

We will show that the HTSP polynomially reduces to the 
4-segments HTSP. Let 5* = {si, S2, . . . , s n } be the input to 
any instance of the HTSP. We will generate an instance of 
the 4-segments HTSP that has as input (n + 3) strings. Let 
I be the length of each string in S. Note that the optimal 
cost for the HTSP with input 5 is < nl. 



Consider the 4 strings: 1110, 1101, 1011, 0111. The dis- 
tance between any two of them is 2. Now replace each 1 in 
each of these 4 strings with a string of nl l's. Also, replace 
each in each of these strings with a string of nl 0's. Call 
these new strings ti, £2, £3, t4. The distance between any two 
of these strings is 2nl. 

The input strings for the 4-segments HTSP are 31, 92, ... , Qn+3 
and are constructed as follows: q t is nothing but s t with ti 



appended to the left, for 1 < i < n. q n +i is a string of 
length 4nl + I whose I LSBs are O's and whose 4nl MSBs 
equal t^. q-n+2 is a string of length 4nl + I whose I LSBs are 

and whose 4nl MSBs equal t$. Also, q n +3 has all O's in its 

1 LSBs and its 4nl MSBs equal £4. 

Clearly, in an optimal solution for the 4-segments HTSP in- 
stance, the four parts have to be {qi, q2, ■ ■ ■ , q n }, {qn+i}, {q n + 
and {q n +3}- As a result, we can get an optimal solution 
for the HTSP instance given an optimal solution for the 4- 
segments HTSP instance. □ 

3.6.2 A special instance of the BLMP 

Consider the following n 2 strings as an input for the BLMP: 
ti, t2, ■ ■ ■ , t n , t,t, . . . , t. Here there are n 2 — n copies of t. 
Also, 5(U,tj) — 16 for any i and j less than or equal to n. 
5(ti,t) — 9 for any i < n. 

Lemma 2. In an optimal solution to the above BLMP in- 
stance, t\,t2, . . . ,tn lie on the boundary of the n x n grid 
and moreover these strings are found in four segments of 
successive nodes. 

Proof: Let T be the collection of strings ti, t%, . . . , t n . By 
Lemma [T] we conclude that all the strings of T lie on the 
boundary of the grid in an optimal solution. 

Let Si and 5*2 be two segments such that Si and S2 consist of 
strings from T, strings in Si are in successive nodes, strings 
in S2 are in successive nodes, and these two segments are 
not successive. Consider the case when none of these strings 
is in a corner of the grid. Let Si = {ai, as, • • • , a ni } and 
S 2 = {61,62, 6„ 2 }. Let C(Si) = YlHi 1 K a u <H+i) and 
C(S2) = X^i" 1 <^(6ij 61+1). The total cost for these two 
segments is C(Si) + C(S I 2) + 9(ni + n 2 ) + 36. If we join 
these two segments into one, the new cost will be C(Si) + 
C(S 2 ) + 9(ni +n 2 ) + 34. 

Thus it follows that all the strings of T will be on the bound- 
ary and they will be found in successive nodes in any opti- 
mal solution. Also it helps to utilize the corners of the grid 
since each use of a corner will reduce the total cost by 9. 
Therefore in an optimal solution there will be four segments 
such that all the segments are in the boundary of the grid, 
each segment has strings from T in successive nodes, and one 
string of each segment occupies a corner of the grid. In other 
words, an optimal solution for the BLMP instance contains 
an optimal solution for the 4-segments TSP corresponding 
to T. The optimal cost for this BLMP instance is 25n — 28. 
□ 

3. 6. 3 Construction of strings for the above BLMP in- 
stance 

We can construct n 2 strings that have the same properties 
as the ones in the above BLMP instance. 

To begin with, we construct (n + 1) binary strings of length 
n each. The string ti has all l's except in position i, for 
1 < i < n. The position of the LSB of any string is assumed 



to be 1. String t n +i has all l's. Clearly, S(ti,tj) = 2 for any 
i and j less than or equal to n. Also, 8(ti,t n +i) = 1 for any 
1 < i < n. 

Now, in each ti (for 1 < i < (n + 1)) replace every 1 with a 
string of eight l's and replace each with a string of eight 
l's. After this change, 8(ti,tj) — 16 for any 1 < i, j < n and 
}, 5(ti,t n +i) = 8 for any 1 < i < n. 

Finally, append a to the left of each ti (for 1 < i < n) as 
the MSB. Also, append a 1 to the left of t n +i. In this case, 
S(ti,tj) = 16 for any 1 < i,j ' < n and 5(ti,t n +i) = 9 for any 
1 < i < n. 

3.6.4 The alternate proof of the main theorem 

Let S — {si, S2, . . . , s n } be the input for any instance of 
the HTSP. We will generate n 2 strings such that an opti- 
mal solution for the BLMP on these n 2 strings will yield an 
optimal solution for the 4-segments HTSP on S. 

We will use as the basis the (n + 1) strings generated in the 
above section. Recall that these strings ti,t%, ■ ■ ■ , t n +i are of 
length (8n+l) each. Also, 5(U,tj) = 16 for any 1 < i,j < n 
and 8(ti,t„+i) = 9 for any 1 < i < n. 

Replace each in each of the above strings with nl O's and 
replace each 1 in each of these strings with nl l's. Now, 
S(U,tj) = 16rci for any 1 < i,j < n and 5(U,t n +i) = 9nl for 
an y 1 < i < n. Each of these strings is of length (8n + l)nl. 

Replace each in each Si with two O's (for 1 < i < n) 
and replace each 1 in each s; with two l's and let s'i be 
the resultant string. Note that an optimal solution for the 
4-segments HTSP on the revised S will also be an optimal 
solution for the 4-segments HTSP on the old S. If I is the 
length of each string in the old S, then 21 will be the length 
of each revised input string. 

The input for the BLMP instance that we generate will be 
qi , g2 , . . . , q n , t , t , . . . , t where t occurs n 2 — n times. Each of 
these strings will be of length (8n + l)nl + 21. The string qi 
will have s'i in its 21 LSBs and it will have ti in its (8n + l)nl 
MSBs, for 1 < i < n. The string t will have t n +i in its (8n + 
l)nl MSBs. Its 21 LSBs will be 0101 . . .01, i.e., the string 0T 
is repeated I times. Note that S(qi, qj) — 16ni + S(s'i, s'j) for 
an y 1 < i, J ' < n - Also, 5(qi,t) — 9nl + I for any 1 < i < n. 

Note that strings of this BLMP instance are comparable 
to the strings we had for Lemma [2] This is because the 
interstring distances are very nearly in the same ratios for 
the two cases. As a result, using a proof similar to that 
of Lemma [2] we can show that the strings t\,ti, ■ ■ ■ ,t n will 
all lie in the boundary of the grid in an optimal solution 
to the above BLMP. Let T = {ti, t 2 , . . . , t n }- Also, the 
strings of T will be found in four segments such that one 
string of each segment occupies one of the corner nodes of 
the grid. Let Si, S2, S3, and S4 stand for the strings in these 
four segments, respectively. Let Ci, 62,63, and C4 be the 
optimal tour costs for Si,S2,S3, and S4, respectively. 

Let \Si\ = Hi for 1 < i < 4. The total cost (i.e., the border 
length) for the above BLMP solution can be computed as 



(1,1) d.N) 



(N,l) (N,N) 

Figure 2: The thick dark line corresponds to an op- 
timal tour on the input strings 



follows. Consider Si alone. The cost due to this segment 
is Ci + 2(9nl + I) + (m - l)(9nl + I). The cost 2(9nZ + I) 
is due to the two end points of the segment Si. The cost 
(tji — l)(9ni + I) is due to the fact that each string of Si 
(except for the one in a corner of the grid) is a neighbor of a 
t. Upon simplification, the cost for Si is Ci + (rii + l)(9nl+l). 
Summing over all the four segments, the total cost for the 
BLMP solution is ft + C 2 + C* 3 + C 4 + (n + 4) (9nl + 1) . The 
minimum value of this is obtained when Si , S2 , S3 , and S4 
form a solution to the 4-segments HTSP on T. 

Clearly, an optimal solution for the 4-segments HTSP on T 
will also yield an optimal solution for the 4-segments HTSP 
on S. This can be seen as follows. Consider the strings in 
Si and let Qi — a\,a%,.. . ,a? n . be the corresponding input 
strings (of S), for 1 < i < 4. Note that d is nothing but 
(rii — l)(16nZ) plus twice the optimal tour cost for Qi, for 
1 < i < 4. Thus, Ci + C 2 + C 3 + C 4 is equal to in - 4)16nl + 
2(Ci + C' 2 + C3 + C4) where C- is the optimal tour cost for 
Qi, for 1 < i < 4. 

This completes the proof of Theorem [T] □ 

4. ALGORITHMS FOR THE BLMP 
4.1 An o(AO-approximation algorithm 

In this section, we will show that a simple version of the 
algorithm suggested by Hannenhalli, et al. is actually an 
0(7V)-approximation algorithm. This algorithm can be de- 
scribed as follows. Assume that the input is the set of strings 
S = {si, S2, . . . , Sjva}- The algorithm first computes a tour 
T on strings in S. Then it threads the tour T into the 
grid in row-major order (see Figure [2}. The first step can 
be done by calling the |-approximation algorithm for the 
HTSP suggested by [2]. 

Lemma 3. OPThtsp(S) <20PTblmp{S). 

Proof: Let A be an optimal solution for the BLMP on S. 
Consider the path P' drawn as the thick dark line in Figure 
[2 Obviously, Cost(P') < Cost(A) = OPTblmp(S). Let 
s il and Si N2 be the two endpoints of P'. Since the Hamming 



distance satisfies the triangular inequality, 8(si lt Si 2 ) < 
Cost(P'). Consider the tour that starts at s^, traverses 
along the path P' to Si N2 and comes back to . Obviously, 
the cost of the tour is Cost(P') + S(si 1 , Si 2 ) < 2Cost(P') < 
2Cost(A). Hence, OPT H tsp{S) < 20PT B lmp{S). □ 



Theorem 5. The above algorithm yields an 0(N)-approxvmate 
solution. 



Proof: First, we see that Cost(T) < ^OPThtsp(S) < 
2>OPTblmp{S). The first inequality is due to the |-approximation 
for the HTSP. The second inequality is due to Lemma [3] 
Now let us analyze the cost of the solution F produced by 
the algorithm. Consider the path P drawn as the thick dark 
line in Figured Obviously, Cost(P) < Cost(T). Also, the 
total cost of the iV rows in F is no more than Cost(P). 
By the triangle inequality, it is not hard to see that the 
cost of each column in F is no more than Cost(P). There- 
fore, Cost(F) < (N + l)Cost(P) < (N + l)Cost(T) < 
3(N + I)OPTblmp(S) = 0(N)OPTblmp(S). □ 

4.2 A hierarchical refinement algorithm 

Several heuristics such as the Epitaxial growth have been 
proposed to solve the BLMP problem earlier. However most 
of these heuristics do not improve the cost monotonically. 
Local search based algorithms are often employed to solve 
hard combinatorial problems. We now introduce a hierar- 
chical refinement algorithm (HRA). This refinement tech- 
nique can be applied to any heuristic placement to refine 
the cost and get a better placement. Let iV be the num- 
ber of probes in the placement, d a positive integer such 
that d x — N, x > 1 is called the degree of refinement. The 
refinement algorithm starts with a given placement, then it 
divides the placement into s?, s§ . . . s° N ^ d2 sub-problems with 
d 2 probes per sub-problem. Each of these sub-problems 
is solved optimally - an optimal permutation among the 
probes is found. After this every d 2 sub-problems are com- 
bined into a new sub-problem s] = u1j =1 Si d 2 + j, 1 < i < 
N/d 3 . To solve s\ optimally we identify an optimal per- 
mutation among s® d2+ j € s},l < j < d 2 . This process 
continues until we are left with no sub-problems to solve. 
See Figure [31 

We should remark that while solving a sub-problem opti- 
mally, we also consider the cost contributed from the neigh- 
boring sub-problems. This ensures the monotonic improve- 
ment in the placement cost. The refinement algorithm asymp- 
totically runs in 9(d 2 !iV) time. If d = O(l), the refinement 
algorithm runs in linear time. For small values of d, the al- 
gorithm performs well in practice. HRA is a deterministic 
refinement algorithm. We further extend this by introducing 
randomness. The Randomized Hierarchical Refinement Al- 
gorithm (RHRA) is similar to the HRA algorithm. RHRA 
randomly selects a sub-square within the given placement 
and applies the HRA technique to the selected sub-square. 
Similar to local search algorithms, repeating RHRA algo- 
rithm several times improves the placement cost monoton- 
ically. We study the performance of both these algorithms 
in section [5] 
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Figure 3: Illustration of the hierarchical refinement 
algorithm with degree of refinement 3. This shows 
the possible optimal solutions (i.e. permutation 
among sub-problems) at the top-most and penul- 
timate levels 

4.3 Quad epitaxial algorithm 

The epitaxial (EPX) placement suggested in [5] places a 
randomly selected probe at the center of the array, it con- 
tinues placing the probes greedily around the locations adja- 
cent to the placed probes to minimize the cost (i.e. the algo- 
rithm almost spends 0(N 2 ) time to place each probe). The 
epitaxial algorithm gives good results for small arrays but 
for larger arrays the epitaxial algorithm is impractical and 
extremely slow. We propose the Quad Epitaxial (QEPX) 
algorithm as a simple extension to the epitaxial algorithm. 
QEPX yields good performance and is very fast compared 
to the EPX algorithm. The basic idea behind the QEPX 
algorithm is to divide the array into four parts, apply EPX 
algorithm for each of the four parts and finally find an op- 
timal arrangement among the four parts. In section [S] we 
compare the QEPX algorithm with EPX algorithm. 

5. EXPERIMENTAL STUDY 

5.1 Performance of the qepx algorithm 

In this section we compare the performance of QEPX algo- 
rithm introduced earlier. We use randomly generated probe 
arrays of size 32 2 ,64 2 ,128 2 and 256 2 . In all of our experi- 
mental studies we compute a lower bound on the solution 
by picking the smallest 2N(N — 1) edges from the complete 
Hamming distance graph. Column-4(INIT COST) in the 
table [T] indicates the placement cost obtained by placing 
the probes in the row major order as given by the input. 
Column-5(8) indicates the final placement cost obtained by 
the epitaxial (quad) algorithm. As we can see from columns 
7 and 10, the refinement obtained by the QEPX algorithm 
is very close to the EPX algorithm. On the other hand 
QEPX runs 3.6A faster than the EPX algorithm. As we 
can see from table [1] as the chip size increases EPX algo- 
rithm becomes very slow. We ran both EPX and QEPX 
algorithms on a chip size of 243 x 243 with a time limit of 
60 minutes. The QEPX algorithm took around 12 minutes 
to complete and improved the input placement cost by 36%. 
On the other hand the EPX algorithm did not complete 
the placement. From our experiments we conclude that the 
QEPX can provide a good placement which we can use 



as an input for refinement/local search algorithms such as 
RHRA. In the next sub-section we provide our experimental 
study of HRA and RHRA algorithms on various placement 
heuristics. 

5.2 Performance of refinement algorithms 

We have applied our HRA, RHRA refining algorithms on 
the following placement heuristics. 



• (RAND) Random placement: in this placement we 
just use the order in which the probes are provided to 
our algorithm. 

• (SORT) Sort placement: in this placement the input 
probes are sorted lexicographically 

• (SWM) Sliding Window Matching placement is ob- 
tained by running the SWM [S] algorithm with pa- 
rameters (6, 3). 

• (REPX) Row epitaxial placement is obtained by run- 
ning the row-epitaxial algorithm with 3 look-ahead 
rows. 

• (EPX) Epitaxial placement is obtained by running the 
EPX algorithm 

• (QEPX) Quad epitaxial placement obtained by our 
quad-epitaxial algorithm 



The cost of the placement obtained by running the HRA al- 
gorithm exactly once is given in column-5 (HRA). Column- 
6 (RHRA) indicates the placement cost obtained by running 
our randomized refinement algorithm RHRA for 350 itera- 
tions. From table [5] we can see that as initial placement 
moves closer and closer towards the lower bound the refine- 
ment percentage decreases, which is logical. For test cases 
with 729, 6561 (1024, 4096) probes we use a refinement de- 
gree d = 3 (d = 2). Choosing a bigger refinement degree 
gives better refinements, however takes more time. Finally 
we conclude that our refinement algorithms would be very 
useful when applied in conjunction with fast initial place- 
ment heuristics. A fully function program called blm-solve 
implementing all our algorithms can be downloaded from the 
website http : //launchpad . net /blm-solve , the web-site also 
has all the supplementary details used in the our experimen- 
tal study. 

6. CONCLUSIONS 

In this paper we have studied the Border Length Minimiza- 
tion Problem (BLMP) that has numerous applications in 
biology and medicine. We have solved a seven-year old open 
problem in this area by showing that the BLMP is MV- 
hard. Two different proofs have been given and we believe 
that the techniques in these proofs will find independent ap- 
plications. We have also shown that certain generalizations 
of the BLMP are A/'P-hard as well. In addition, we have 
presented a hierarchical refinement algorithm (HRA) for the 
BLMP. Deterministic and randomized versions of this algo- 
rithm can be used to refine the solutions obtained from any 
algorithm for solving the BLMP. Our experimental results 
indicate that indeed HRA can be useful in practice. 



TEST 


PROBES 


LOWER 


INIT 


EPX 


TIME 


REFINED 


QEPX 


TIME 


REFINED 


CASE 




BOUND 


COST 




(sec) 


PRECENT 




(sec) 


PRECENT 


t-0 


1024 


23480 


37192 


27591 


0.60 


25.81% 


28060 


0.42 


24.55% 


t-1 


1024 


23427 


37029 


27472 


0.62 


25.81% 


28151 


0.43 


23.98% 


t-0 


4096 


86818 


151116 


106471 


10.70 


29.54% 


107805 


3.05 


28.66% 


t-1 


4096 


86897 


151176 


106430 


10.37 


29.60% 


107634 


3.23 


28.80% 


t-0 


16384 


322129 


609085 


410301 


180.00 


32.64% 


411746 


43.93 


32.40% 


t-1 


16384 




608928 


409625 


185.88 


32.73% 


410902 


44.70 


32.52% 


t-0 


65536 




2447885 


2447885 




0.00% 


1563369 


765.79 


36.13% 


t-1 


65536 




2427143 


2427143 




0.00% 


1562630 


774.33 


35.62% 



Table 1: Comparison between epitaxial and quad epitaxial 
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1.27(sec) 
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107766 
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5.04(sec) 
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REPTX 


86818 
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116395 
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0.026% 


1.02(sec) 


4096 
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86818 


106471 


106462 


106448 


0.022% 


1.04(sec) 



Table 2: Cost refinement for various placement heuristics by applying HRA (hierarchical refinement algo- 
rithm) and RHRA (randomized hierarchical refinement algorithm) with 350 iterations 



One of the best performing algorithms for the BLMP is the United States of America, 100(6):3410-3415, 2003. 

epitaxial algorithm (EPX). This algorithm takes too much 
time especially when the number of probes is large. In this 
paper we present a variant called the quad-epitaxial algo- 
rithm (QEPX) that is much faster than EPX while yielding 
a solution that is very close to that of EPX in quality. QEPX 
partitions the input into four parts, works on each part sep- 
arately, and finally combines these solutions. This idea can 
be extended further to partition the input into more parts 
and hence this algorithm is ideal for parallelism. 

Some of the open problems are: 1) In this paper we have 
used a simple lower bound on the quality of solution for the 
BLMP. It will be nice to develop tighter lower bounds; 2) 
Develop more efficient algorithms than EPX; and 3) Design 
parallel algorithms for the BLMP. 
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Abstract. Protein/Peptide microarrays are rapidly gaining momentum 
in the diagnosis of cancer. High-density and high-throughput peptide 
arrays are being extensively used to detect tumor biomarkers, examine 
kinase activity, identify antibodies having low serum titers and locate an- 
tibody signatures. Improving the yield of microarray fabrication involves 
solving a hard combinatorial optimization problem called the Border- 
Length Minimization Problem [BLMP). An important question that re- 
mained open for the past seven years is if the BLMP is tractable or not. 
We settle this open problem by proving that the BLMP is A/'P-hard. 
We also present a hierarchical refinement algorithm which can refine 
any heuristic solution for the BLMP problem. We also prove that the 
TSP+l-threading heuristic is an 0(A)-approxfmation. 
The hierarchical refinement solver is available as an open-source code at 
http : / /launchpad. net /blm- solve. 



1 Introduction 

Cancer diagnosis research has taken a new direction recently by adopting peptide 
microarrays for reliable detection of tumor biomarkers (Chatterjee, et al., [?]), 
(Melle, et al., [?]), (Welsh, et al., [?]). These high-throughput arrays also find 
application in examining kinase activity, identifying antibody signatures against 
tumor antigens, etc. High-density peptide arrays are currently fabricated using 
technologies such as photolithography or in-situ synthesis based on micromirror 
arrays. The manufacturers of these arrays are facing serious fabrication chal- 
lenges due to unintended illumination effects such as diffraction and scattering 
of light. These illumination effects can be reduced dramatically by selecting a 
right placement of the peptide probes before fabrication. Finding this placement 
can be formulated as a combinatorial optimization problem, known as the Bor- 
der Length Minimization Problem (BLMP). Hannenhalli, et al. first introduced 
BLMP in 2002 [?]. Although the BLMP was formulated in the context of DNA 
microarrays, peptide arrays share a similar fabrication technology. 

The BLMP can be stated as follows. Given N 2 strings of the same length, 
how do we place them in a grid of size N x N such that the Hamming distance 
summed over all the pairs of neighbors in the grid is minimized? The BLMP 
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has received a lot of attention from many researchers. The earliest algorithm 
suggested by Hannenhalli, et al. reduces BLMP to TSP (Traveling Salesman 
Problem) by computing a tour of the strings and then threading the tour on the 
grid [?]. Kahng, et al. have proposed several other heuristic algorithms which 
are considered the best performing algorithms in practice [?]. De Carvalho, et al. 
introduced a quadratic program formulation of the BLMP but unfortunately the 
quadratic program is an intractable problem [?]. Later, Kundeti and Rajasekaran 
formulated the problem as an integer linear program which performs better than 
the quadratic program in practice [?] . 

Despite many studies on the BLMP, the question of whether BLMP is tractable 
or not remained open for the past 7 years. In this paper, we show that the BLMP 
is TV'P-hard. We also consider a generalization of the BLMP called the Hamming 
Graph Placement Minimization Problem (HGPMP). We show that some special 
cases of the HGPMP are also TV'P-hard. On the algorithmic side, we show that 
a simple version of the algorithm suggested by Hannenhalli, et al. is an O(N)- 
approximation. On the practical side, we propose a refinement algorithm which 
takes any solution and tries to improve it. An experimental study of this refine- 
ment algorithm is also included. 

Our paper is organized as follows. Section 2 formally defines the BLMP and 
HGPMP. Section 3 provides the AA'P-hardness proof of the BLMP and some 
special cases of the HGPMP. Section 4 gives the 0(iV)-approximation algorithm 
and the refinement algorithm for the BLMP. Section 5 provides an experimental 
evaluation of the refinement algorithm. Finally, Section 6 concludes our paper 
and discusses some open problems. 

2 Problem definition 

Let 5 be a set of strings of the same length with S — {s\, . . . ,s n } and let 
G = (V,E) be a graph with \V\ = n. A placement of S on G is a bijective 
map / : S — > V. Let be the string that is mapped to vertex u by the 

placement /. We denote the Hamming distance between two strings Sj and Sj 
as 5(si,Sj). The cost of placement / is Cost(f) = J2 e =(u,v)eE <K/ _1 ( U )> 
The Hamming Graph Placement Minimization Problem (HGPMP) is defined as 
follows. Given S and G, find a placement of S on G of minimum cost. We denote 
the optimal cost as OPT(S, G), or simply as OPT if it is clear what S and G 
are. 

Obviously, if G is a ring graph, then HGPMP is the same as the well-known 
Hamming Traveling Salesman Problem (HTSP). If G is a grid graph of size 
N xN (where N 2 — n) , then HGPMP becomes the Border Length Minimization 
Problem (BLMP), which is the main study of our paper. 

3 AfP-hardness of the BLMP 
and HGPMP 

Theorem 1. The BLMP is MV-hard. 
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We will show that the Hamming traveling salesperson problem (HTSP) for 
strings (with the Hamming distance metric) polynomially reduces to the BLMP. 
The HTSP is already denned in Section 2. 

The idea of the proof is that given AN strings for the HTSP we construct 
(iV + 1) 2 strings for the BLMP such that from an optimal solution to this BLMP, 
we can easily obtain an optimal solution for the HTSP. So we need to consider the 
variant of the HTSP in which the number of strings is divisible by 4. The proof 
will be presented in stages. The next three subsections present some preliminaries 
needed for the proof of the theorem. Followed by these subsections, the proof is 
presented. 

3.1 4iV-strings traveling salesperson problem 

Define an instance of the HTSP as a AN -strings HTSP if the number of strings in 
the input is AN (for some integer N) . In this section we show that the 4A^-strings 
HTSP is ./VP-hard. 

Theorem 2. AN -strings HTSP is MV-hard. 

Proof: We will show that the HTSP polynomially reduces to the 4iV-strings 
HTSP. Let S = {si,s 2 , . . . , s n } be the input for any instance of the HTSP. Let t 
be the length of each input string. Append a string of 2ni O's to the left of each 
Si to get s^ (for 1 < i < (n — 1)). For example, if n = A, I = 3 and si = 101, 
then si will be 000000000000000000000000101. We append 2nl T's to the left 
of s n to get s' n . We will generate an instance S' of the 4A-strings HTSP that 
has as input AN strings, where N — . S' will have s[, s' 2 , . ■ . , s' n _ 1 and 1, 2, 3 
or 4 copies of s' n depending on whether n = AN, AN — 1,47V — 2, or AN — 3, 
respectively. 

It is easy to see that in an optimal tour for the above 4iV-strmgs HTSP 
instance, all the copies of s' n will be successive and that an optimal solution for 
S can be obtained readily from an optimal solution for S'. □ 

3.2 A special instance of the BLMP 

Consider the following (N + l) 2 strings as an input for the BLMP: 
ti, t 2 , ■ ■ ■ , t 4 N, t,t,...,t. Here there are N 2 — 2N + 1 copies of t. There is a pos- 
itive integer k such that 5{U,t) = k for any 1 < i < AN and 2k > 5(ti,tj) > \k 
for any 1 < i ^ j < AN. 

Lemma 1. In any optimal solution to the above BLMP instance, t\,t 2 , ■ ■ ■ ,t 4 N 
will lie on the boundary of the (N + 1) x (N + 1) grid (see Figure 1). 

Proof: This can be proven by contradiction. Let T be the collection of the 
strings t\, t 2 , ■ ■ ■ , t 4 N- Let q be one of the strings from T that has a degree of 4 
in an optimal placement. Let r be one of the strings equal to t that lies in the 
boundary. Next we show that we can get a better solution by exchanging q and 
r. 
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Let u be the number of neighbors of q from T. Let v be the number of 
neighbors of r from T. Note that < u < 4 and < v < 3. In the current 
solution, the total cost incurred by q and r is at least \ku + fc(4 — u) + = 
|fcit + fcu + 4fc. If we exchange q and r, the new total cost incurred by q and r 
is strictly less than ku + 2kv + fc(3 — u) = ku + few + 3fc. The old cost minus the 
new cost is strictly greater than k — jku > 0. 

We thus conclude that all the strings of T lie on the boundary of the grid in 
any optimal solution. □ 



(t) © (f> 



© © ©- 



© © Q> 



Fig. 1. An illustration for Lemma 1 with N = 4. Each lies on a dark vertex in the 
grid. 



3.3 A special set of strings and some operations on strings 

We denote the (ordered) concatenation of two strings x and y as x + y. If x and 
x' (respectively y and y') have the same length then, clearly, S(x + y, x' + y') = 
8{x,x')+5(y,y'). 

Given a string x — x\Xi ■ ■ ■ xi and an integer h, let REPh(x) be the string 
x\Xi . . . X1X2X2 ■ ■ ■ X2 ■ ■ ■ xixi . . . x\ , where each Xj, appears h times (REP stands 
for "replicate" ) . It is not hard to see that if x and y have the same length, then 
5{REP h {x),REP h (y)) = hS(x,y). 

Given an integer n, we can construct a set of n strings of length n each, 
A n — {ai, d2, ■ • • , fl n }, such that 8 [en, aj) = 2 for any 1 < i ^ j < n. One way 
to construct A n is to let a, = 00 . . . 0100 ... 0, where there are (i — 1) 0's before 
1. It is easy to check that 5(a,i, aj) = 2 for any 1 < i ^ j < n. 

3.4 Proof of the main theorem 

Now we are ready to present the proof of Theorem 1. Let S = {si, S2, ■ ■ ■ , s^n} 
be the input for any instance of the 4A r -strings HTSP. Each Sj has the length I. 
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We will generate (N + l) 2 strings such that an optimal solution for the BLMP 
on these (N + l) 2 strings will yield an optimal solution for the 47V-strings HTSP 
on S. 

The input for the BLMP instance that we generate will be 
T = {ti,t2, ■ ■ ■ ,tiN,t,t, . . . ,t} where t occurs TV 2 — 2N + 1 times. We set 
ti = REPh{ai) + REP2(si), where a, is the i-th string in the set A^n defined in 
subsection 3.3. We will choose h later. Also, we set t = REP4Nh(0) + 0101 ... 01, 
where the string 01 is repeated I times. We can easily check that: 



We choose h so that T satisfies the condition in Lemma 1. Particularly, 
choose h = 81. Now we will show that OPT B lmp{T) = A(N — l)(h + l) + 8Nh + 
20PThtsp(S), which in turn means that an optimal solution for the BLMP on 
T will yield an optimal solution for the 4iV-strings HTSP on S. 

Let A = Si, , 

s t2i ■ ■ ■ > s i4.N be an optimal tour for the 4iV-string HTSP on S. 
We construct a solution A 1 for the BLMP on T by placing ij's on the border of 
the grid in the order , U 2 , . . . , ti 4N and placing the copies of t on the center 
of the grid. By the equalities (1) and (2), the cost of A' is Cost(A') = 4(N - 
l)(h + l) + 8Nh + 2Cost(A). Therefore, OPT BLM p{T) < 4(N -l)(h + l) + 8Nh + 
20PThtsp(S). 

On the other hand, let B be an optimal solution for the BLMP on T. By 
Lemma 1, t^s lie on the border of the grid and the copies of t lie on the center 
of the grid. Assume that tj's lie in the order ti 1 ,ti 2 , . . . , i, 4Jv . We can construct 
a tour B' for the 4A^-strings HTSP on S in the order Si 1 , s i2 , . . . , s i4N . By the 
equalities (1) and (2), Cost(B) = 4(N - \){h + I) + 8Nh + 2Cost(B'). Hence, 
OPTblmp (T) > 4(AT - l)(h + l) + 8Nh + 20PT HTSP (S). 

This completes the proof of Theorem 1. □ 

3.5 AfV- hardness of the HGPMP for other special cases 

We can generalize the result in Theorem 1 for other special cases of the HGPMP. 
We say graph G is "bordered-ring" if G is undirected and G has a ring of size 
fl(n a ) for some constant a > such that every vertex in the ring has degree no 
greater than d and every vertex outside the ring has degree greater than d for 
some d > 3. For grid graphs, a = | and d = 3. Some variants of grid graphs like 
Manhattan grids are bordered-ring as well. 

Theorem 3. The HGPMP is NT-hard even if G is bordered-ring. 



S(U, t) = h + l for any 1 < i < AN 
S(ti,tj) = 2h + 2S(si,Sj) < 2h + 2l 
for any 1 < i ^ j < AN 



(1) 
(2) 



Proof: By a similar reduction to that of the BLMP above, the theorem follows. 
□ 
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3.6 An alternate A/^P-hardness proof 
for the BLMP 

In this section, we give an alternate ./VP-hardness proof for the BLMP by show- 
ing that another variant of the HTSP called fc-Segments HTSP polynomially 
reduces to the BLMP. We believe that the techniques introduced in both of our 
proofs will find independent applications. 

fc-Segments traveling salesperson problem 

We define the fc-scgments HTSP and show that it is NP-hard. Consider an input 
of n strings: s\, S2, ■ ■ ■ , s n . The problem of fc-segments HTSP is to partition the n 
strings into k parts such that the sum of the optimal tour costs for the individual 
parts is minimum. 

Theorem 4. The k-segments HTSP for strings is MV-hard. 

Proof: We will prove this for k = 4 (since this is the instance that will be useful 
for us to prove the main result) and the theorem will then be obvious. 

We will show that the HTSP polynomially reduces to the 4-segments HTSP. 
Let S — {s\, S2, ■ ■ ■ , s n } be the input to any instance of the HTSP. We will 
generate an instance of the 4-segments HTSP that has as input (n + 3) strings. 
Let I be the length of each string in S. Note that the optimal cost for the HTSP 
with input S is < nl. 

Consider the 4 strings: 1110, 1101, 1011, 0111. The distance between any 
two of them is 2. Now replace each 1 in each of these 4 strings with a string of 
nl l's. Also, replace each in each of these strings with a string of nl 0's. Call 
these new strings t\,t2,ts,t4. The distance between any two of these strings is 
2nl. 

The input strings for the 4-segmcnts HTSP are qi , (72, ■ • ■ , q n +3 and are con- 
structed as follows: qi is nothing but Si with t\ appended to the left, for 1 < i < n. 
q n +i is a string of length Anl + 1 whose I LSBs are 0's and whose 4nZ MSBs equal 
t2- qn+2 is a string of length 4nl + I whose I LSBs are and whose Anl MSBs 
equal £3. Also, q n +3 has all 0's in its / LSBs and its 4nl MSBs equal £4- 

Clearly, in an optimal solution for the 4-segments HTSP instance, the four 
parts have to be {q\,q2, ■ ■ ■ ,q n }, {^n+i}, {Qn+2}, and {<7„ + 3}. As a result, we 
can get an optimal solution for the HTSP instance given an optimal solution for 
the 4-segments HTSP instance. □ 

A special instance of the BLMP 

Consider the following n 2 strings as an input for the BLMP: 
t\,t2, ■■■ ,t n ,t,t, ... ,t. Here there are n 2 — n copies of t. Also, S(ti,tj) = 16 
for any i and j less than or equal to n. 5(ti,t) = 9 for any i < n. 

Lemma 2. In an optimal solution to the above BLMP instance, t\, <2, . . . , t n lie 
on the boundary of the n x n grid and moreover these strings are found in four 
segments of successive nodes. 
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Proof: Let T be the collection of strings t\, t 2 , . . . , t n . By Lemma 1, we conclude 
that all the strings of T lie on the boundary of the grid in an optimal solution. 

Let Si and S 2 be two segments such that Si and S 2 consist of strings from 
T, strings in Si are in successive nodes, strings in S2 are in successive nodes, 
and these two segments are not successive. Consider the case when none of 
these strings is in a corner of the grid. Let Si = {ai, a 2 , ■ ■ ■ > a ni } and S 2 = 
{bi,b 2 ,...,b n2 }. Let C(Si) = YhLi 6(a»>a»+i) and C(S 2 ) = YJili *>»+i)- 
The total cost for these two segments is C(Si ) + C{S2 )+9(rai+n 2 ) + 36. If we join 
these two segments into one, the new cost will be C(S'i) + C(S'2) + 9(ni + n2) + 34. 

Thus it follows that all the strings of T will be on the boundary and they 
will be found in successive nodes in any optimal solution. Also it helps to utilize 
the corners of the grid since each use of a corner will reduce the total cost by 
9. Therefore in an optimal solution there will be four segments such that all the 
segments are in the boundary of the grid, each segment has strings from T in 
successive nodes, and one string of each segment occupies a corner of the grid. 
In other words, an optimal solution for the BLMP instance contains an optimal 
solution for the 4-segments TSP corresponding to T. The optimal cost for this 
BLMP instance is 25n - 28. □ 

Construction of strings for the above BLMP instance 

We can construct n 2 strings that have the same properties as the ones in the 
above BLMP instance. 

To begin with, we construct (n + 1) binary strings of length n each. The 
string ti has all l's except in position i, for 1 < i < n. The position of the LSB 
of any string is assumed to be 1. String t n +i has all l's. Clearly, 8{U,tj) — 2 for 
any i and j less than or equal to n. Also, S(U, t n +i) = 1 for any 1 < i < n. 

Now, in each ti (for 1 < i < (n + 1)) replace every 1 with a string of eight 
l's and replace each with a string of eight l's. After this change, 5(ti, tj) = 16 
for any 1 < i,j < n and 5{ti,t n +i) = 8 for any 1 < i < n. 

Finally, append a to the left of each U (for 1 < i < n) as the MSB. Also, 
append a 1 to the left of t n+ i. In this case, S(t i} tj) — 16 for any 1 < i,j < n 
and 5(ti,t n+ i) = 9 for any 1 < i < n. 

The alternate proof of the main theorem 

Let S = {si, S2, ■ ■ ■ , s n } be the input for any instance of the HTSP. We will 
generate n 2 strings such that an optimal solution for the BLMP on these n 2 
strings will yield an optimal solution for the 4-segments HTSP on S. 

We will use as the basis the (n + 1) strings generated in the above sec- 
tion. Recall that these strings ti, ■ ■ ■ , t n+ i are of length (8n + 1) each. Also, 
5(ti,tj) = 16 for any 1 < i,j • < n and S(ti, t„+i) = 9 for any 1 < i < n. 

Replace each in each of the above strings with nl O's and replace each 1 in 
each of these strings with nl l's. Now, S(ti,tj) = 16nl for any 1 < i,j < n and 
5(ti, t n +i) — 9nl for any 1 < i < n. Each of these strings is of length (8n + l)nl. 

Replace each in each Si with two O's (for 1 < i < n) and replace each 1 
in each Si with two l's and let s- be the resultant string. Note that an optimal 
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solution for the 4-segments HTSP on the revised S will also be an optimal 
solution for the 4-segments HTSP on the old S. If I is the length of each string 
in the old S, then 21 will be the length of each revised input string. 

The input for the BLMP instance that we generate will be qi, q 2 , . . . , q n , t, t, . . . , t 
where t occurs n 2 —n times. Each of these strings will be of length (8n+l)nl + 2l. 
The string q t will have in its 21 LSBs and it will have U in its (8n + l)nl 
MSBs, for 1 < i < n. The string t will have t n+1 in its (8n + l)nl MSBs. Its 
21 LSBs will be 0101... 01, i.e., the string 01 is repeated I times. Note that 
d(Qi><lj) = 16«Z + 5(s' i7 s'j) for any 1 < i,j < n. Also, 5(qi,t) = 9nl + I for any 
1 < i < n. 

Note that strings of this BLMP instance are comparable to the strings we 
had for Lemma 2. This is because the interstring distances are very nearly in 
the same ratios for the two cases. As a result, using a proof similar to that of 
Lemma 2, we can show that the strings t\,t 2 , . . . ,t n will all lie in the boundary 
of the grid in an optimal solution to the above BLMP. Let T = {t\, t 2 , ■ ■ ■ , t n }. 
Also, the strings of T will be found in four segments such that one string of 
each segment occupies one of the corner nodes of the grid. Let Si, S 2 ,S 3 , and 
S4 stand for the strings in these four segments, respectively. Let Ci, C 2 , C 3 , and 
C 4 be the optimal tour costs for Si, S 2 , S 3 , and S4, respectively. 

Let Si I = rii for 1 < i < 4. The total cost (i.e., the border length) for the 
above BLMP solution can be computed as follows. Consider S\ alone. The cost 
due to this segment is d + 2(9nl + I) + (m - l)(9nZ + I). The cost 2(9nl + I) is 
due to the two end points of the segment Si. The cost (m — l)(9nl + I) is due 
to the fact that each string of Si (except for the one in a corner of the grid) is 
a neighbor of a t. Upon simplification, the cost for Si is C\ + {n\ + l)(9nl + I). 
Summing over all the four segments, the total cost for the BLMP solution is 
C\ + C2 + C3 + C4 + (n + A)(9nl + I). The minimum value of this is obtained 
when Si, S2, S3, and S4 form a solution to the 4-segments HTSP on T. 

Clearly, an optimal solution for the 4-segments HTSP on T will also yield 
an optimal solution for the 4-segments HTSP on S. This can be seen as follows. 
Consider the strings in Si and let Qi = a\ , a 2 , ■ ■ ■ , a l n . be the corresponding input 
strings (of S), for 1 < i < 4. Note that d is nothing but (rii — l)(16nZ) plus 
twice the optimal tour cost for Q i: for 1 < i < 4. Thus, C\ + C 2 + C3 + C\ is 
equal to (n — 4)16nZ + 2(C[ + C' 2 + C3 + C' 4 ) where C[ is the optimal tour cost 
for Qi, for 1 < i < 4. 

This completes the proof of Theorem 1. □ 

4 Algorithms for the BLMP 

4.1 An 0(.ZV)-approximation algorithm 

In this section, we will show that a simple version of the algorithm suggested by 
Hannenhalli, et al. is actually an 0(A)-approximation algorithm. This algorithm 
can be described as follows. Assume that the input is the set of strings S = 
{si, s 2 , . . . , sjv 2 }- The algorithm first computes a tour T on strings in S. Then it 
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threads the tour T into the grid in row-major order (see Figure 2). The first step 
can be done by calling the |-approximation algorithm for the HTSP suggested 

by[?]. 

(U) (im 



(N,l) (N,N) 
Fig. 2. The thick dark line corresponds to an optimal tour on the input strings 



Lemma 3. OPT HTSP (S) < 20PT BLM p(S). 

Proof: Let A be an optimal solution for the BLMP on S. Consider the path 
P' drawn as the thick dark line in Figure 2. Obviously, Cost(P') < Cost (A) = 
OPTblmp(S). Let and s ijv2 be the two endpoints of P' . Since the Hamming 
distance satisfies the triangular inequality, S(s il ,s iN2 ) < Cost(P'). Consider the 
tour that starts at Sj i; traverses along the path P' to Si N2 and comes back to 
Sjj. Obviously, the cost of the tour is Cost(P') + 5(s il ,s iN2 ) < 2Cost(P') < 
2Cost(A). Hence, OPT HTS p(S) < 20PT BL mp(S). □ 

Theorem 5. The above algorithm yields an O(N) -approximate solution. 

Proof: First, we see that Cost(T) < ^OPT HTSP {S) < 30PT BLMP (S). The 
first inequality is due to the |-approximation for the HTSP. The second inequal- 
ity is due to Lemma 3. Now let us analyze the cost of the solution F produced by 
the algorithm. Consider the path P drawn as the thick dark line in Figure 2. Ob- 
viously, Cost(P) < CostiT). Also, the total cost of the N rows in F is no more 
than Cost(P). By the triangle inequality, it is not hard to see that the cost of each 
column in F is no more than Cost(P). Therefore, Cost(F) < (N+l)Cost(P) < 
(N + l)Cost(T) < 3(N + l)OPT BLM p(S) = 0(N)OPT BLM p(S). □ 

4.2 A hierarchical refinement algorithm 

Several heuristics such as the Epitaxial growth have been proposed to solve the 
BLMP problem earlier. However most of these heuristics do not improve the cost 
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monotonically. Local search based algorithms are often employed to solve hard 
combinatorial problems. We now introduce a hierarchical refinement algorithm 
(HRA). This refinement technique can be applied to any heuristic placement 
to refine the cost and get a better placement. Let N be the number of probes 
in the placement, d a positive integer such that d x = N, x > 1 is called the 
degree of refinement. The refinement algorithm starts with a given placement, 
then it divides the placement into sj, s° . . . s^r/d 2 sub-problems with d 2 probes 
per sub-problem. Each of these sub-problems is solved optimally - an optimal 
permutation among the probes is found. After this every d 2 sub-problems are 
combined into a new sub-problem s\ = U^ =1 s^2^ :) , 1 < i < N/d 3 . To solve 
s] optimally we identify an optimal permutation among s° ic p + - £ sj,l < j < 
d 2 . This process continues until we are left with no sub-problems to solve. See 
Figure 3. 

We should remark that while solving a sub-problem optimally, we also con- 
sider the cost contributed from the neighboring sub-problems. This ensures the 
monotonic improvement in the placement cost. The refinement algorithm asymp- 
totically runs in 0(d 2 \N) time. If d = 0(1), the refinement algorithm runs in 
linear time. For small values of d, the algorithm performs well in practice. HRA 
is a deterministic refinement algorithm. We further extend this by introducing 
randomness. The Randomized Hierarchical Refinement Algorithm (RHRA) is 
similar to the HRA algorithm. RHRA randomly selects a sub-square within 
the given placement and applies the HRA technique to the selected sub-square. 
Similar to local search algorithms, repeating RHRA algorithm several times 
improves the placement cost monotonically. We study the performance of both 
these algorithms in section 5. 
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Fig. 3. Illustration of the hierarchical refinement algorithm with degree of refinement 
3. This shows the possible optimal solutions (i.e. permutation among sub-problems) at 
the top-most and penultimate levels 
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4.3 Quad epitaxial algorithm 

The epitaxial (EPX) placement suggested in [?] places a randomly selected 
probe at the center of the array, it continues placing the probes greedily around 
the locations adjacent to the placed probes to minimize the cost (i.e. the algo- 
rithm almost spends 0(N 2 ) time to place each probe). The epitaxial algorithm 
gives good results for small arrays but for larger arrays the epitaxial algorithm 
is impractical and extremely slow. We propose the Quad Epitaxial (QEPX) 
algorithm as a simple extension to the epitaxial algorithm. QEPX yields good 
performance and is very fast compared to the EPX algorithm. The basic idea 
behind the QEPX algorithm is to divide the array into four parts, apply EPX 
algorithm for each of the four parts and finally find an optimal arrangement 
among the four parts. In section 5 we compare the QEPX algorithm with EPX 
algorithm. 

5 Experimental study 

5.1 Performance of the QEPX algorithm 

In this section we compare the performance of QEPX algorithm introduced ear- 
lier. We use randomly generated probe arrays of size 32 2 ,64 2 ,128 2 and 256 2 . In 
all of our experimental studies we compute a lower bound on the solution by pick- 
ing the smallest 2N(N — 1) edges from the complete Hamming distance graph. 
Column-4(INIT COST) in the table 1 indicates the placement cost obtained by 
placing the probes in the row major order as given by the input. Column-5(8) 
indicates the final placement cost obtained by the epitaxial (quad) algorithm. 
As we can see from columns 7 and 10, the refinement obtained by the QEPX 
algorithm is very close to the EPX algorithm. On the other hand QEPX runs 
3.6X faster than the EPX algorithm. As we can see from table 1, as the chip size 
increases EPX algorithm becomes very slow. We ran both EPX and QEPX 
algorithms on a chip size of 243 x 243 with a time limit of 60 minutes. The 
QEPX algorithm took around 12 minutes to complete and improved the input 
placement cost by 36%. On the other hand the EPX algorithm did not complete 
the placement. From our experiments we conclude that the QEPX can provide 
a good placement which we can use as an input for refinement/local search al- 
gorithms such as RHRA. In the next sub-section we provide our experimental 
study of HRA and RHRA algorithms on various placement heuristics. 

5.2 Performance of refinement algorithms 

We have applied our HRA, RHRA refining algorithms on the following place- 
ment heuristics. 

— (RAND) Random placement: in this placement we just use the order in 
which the probes are provided to our algorithm. 

— (SORT) Sort placement: in this placement the input probes are sorted lex- 
icographically 
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TEST 


PROBES 


LOWER 


INIT 


EPX 


TIME 


REFINED 


QEPX 


TIME 


REFINED 


CASE 




BOUND 


COST 




(sec) 


PRECENT 




(sec) 


PRECENT 


t-0 


1024 


23480 


37192 


27591 


0.60 


25.81% 


28060 


0.42 


24.55% 


t-1 


1024 


23427 


37029 


27472 


0.62 


25.81% 


28151 


0.43 


23.98% 


t-0 


4096 


86818 


151116 


106471 


10.70 


29.54% 


107805 


3.05 


28.66% 


t-1 


4096 


86897 


151176 


106430 


10.37 


29.60% 


107634 


3.23 


28.80% 


t-0 


16384 


322129 


609085 


410301 


180.00 


32.64% 


411746 


43.93 


32.40% 


t-1 


16384 




608928 


409625 


185.88 


32.73% 


410902 


44.70 


32.52% 


t-0 


65536 




2447885 


2447885 




0.00% 


1563369 


765.79 


36.13% 


t-1 


65536 




2427143 


2427143 




0.00% 


1562630 


774.33 


35.62% 



Table 1. Comparison between epitaxial and quad epitaxial 



PROBES 


ALGO 


LOWER 


INIT 


HRA 


RHRA 


REFINED 


TIME 






BOUND 


COST 






PRECENT 




729 


RAND 


17087 


26401 


23970 


22631 


14.280% 


2.83(min) 


729 


SORT 


17087 


24082 


22415 


21649 


10.103% 


2.81(min) 


729 


SWM 


17087 


22267 


22195 


22069 


0.889% 


2.81(min) 


729 


REPTX 


17087 


21115 


21107 


21101 


0.066% 


2.81(min) 


729 


EPTX 


17087 


19733 


19726 


19726 


0.035% 


2.81(min) 


6561 


RAND 


136820 


243125 


221090 


209514 


13.825% 


17.55(min) 


6561 


SORT 


136820 


210326 


198972 


191915 


8.754% 


17.02(min) 


6561 


SWM 


136820 


204955 


204525 


203412 


0.753% 


17.20(min) 


6561 


REPTX 


136820 


185386 


185362 


185341 


0.024% 


17.16(min) 


6561 


EPTX 


136820 


168676 


168623 


168544 


0.078% 


17.15(min) 


1024 


RAND 


23480 


37192 


35236 


33046 


11.148% 


0.28(sec) 


1024 


SORT 


23480 


33784 


32326 


31026 


8.164% 


0.26(sec) 


1024 


SWM 


23480 


31424 


31383 


31323 


0.321% 


0.13(sec) 


1024 


QEPX 


23480 


28060 


28035 


28028 


0.114% 


0.47(sec) 


1024 


REPTX 


23480 


29574 


29557 


29546 


0.095% 


O.ll(sec) 


1024 


EPTX 


23480 


27591 


27567 


27565 


0.094% 


O.ll(sec) 


4096 


RAND 


86818 


151116 


143246 


134485 


11.005% 


6.93(sec) 


4096 


SORT 


86818 


131291 


127033 


121742 


7.273% 


4.46(sec) 


4096 


SWM 


86818 


127516 


127357 


127092 


0.333% 


1.27(sec) 


4096 


QEPX 


86818 


107805 


107766 


107702 


0.096% 


5.04(sec) 


4096 


REPTX 


86818 


116406 


116395 


116376 


0.026% 


1.02(sec) 


4096 


EPTX 


86818 


106471 


106462 


106448 


0.022% 


1.04(sec) 



Table 2. Cost refinement for various placement heuristics by applying HRA (hierarchi- 
cal refinement algorithm) and RHRA (randomized hierarchical refinement algorithm) 
with 350 iterations 
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— (SWM) Sliding Window Matching placement is obtained by running the 
SWM [?] algorithm with parameters (6,3). 

— (REPX) Row epitaxial placement is obtained by running the row-epitaxial 
algorithm with 3 look-ahead rows. 

— (EPX) Epitaxial placement is obtained by running the EPX algorithm 

— (QEPX) Quad epitaxial placement obtained by our quad-epitaxial algo- 
rithm 

The cost of the placement obtained by running the HRA algorithm exactly once 
is given in column-5 (HRA). Column-6 (RHRA) indicates the placement cost 
obtained by running our randomized refinement algorithm RHRA for 350 itera- 
tions. From table 2 we can see that as initial placement moves closer and closer 
towards the lower bound the refinement percentage decreases, which is logical. 
For test cases with 729, 6561 (1024, 4096) probes we use a refinement degree 
d = 3 (d = 2). Choosing a bigger refinement degree gives better refinements, how- 
ever takes more time. Finally we conclude that our refinement algorithms would 
be very useful when applied in conjunction with fast initial placement heuristics. 
A fully function program called blm-solve implementing all our algorithms can be 
downloaded from the website http://launchpad.net/blm-solve, the web-site 
also has all the supplementary details used in the our experimental study. 

6 Conclusions 

In this paper we have studied the Border Length Minimization Problem (BLMP) 
that has numerous applications in biology and medicine. We have solved a seven- 
year old open problem in this area by showing that the BLMP is ./VP-hard. 
Two different proofs have been given and we believe that the techniques in 
these proofs will find independent applications. We have also shown that certain 
generalizations of the BLMP are MV-hard as well. In addition, we have presented 
a hierarchical refinement algorithm (HRA) for the BLMP. Deterministic and 
randomized versions of this algorithm can be used to refine the solutions obtained 
from any algorithm for solving the BLMP. Our experimental results indicate that 
indeed HRA can be useful in practice. 

One of the best performing algorithms for the BLMP is the epitaxial algo- 
rithm (EPX). This algorithm takes too much time especially when the number 
of probes is large. In this paper we present a variant called the quad-epitaxial 
algorithm (QEPX) that is much faster than EPX while yielding a solution that 
is very close to that of EPX in quality. QEPX partitions the input into four 
parts, works on each part separately, and finally combines these solutions. This 
idea can be extended further to partition the input into more parts and hence 
this algorithm is ideal for parallelism. 

Some of the open problems are: 1) In this paper we have used a simple lower 
bound on the quality of solution for the BLMP. It will be nice to develop tighter 
lower bounds; 2) Develop more efficient algorithms than EPX; and 3) Design 
parallel algorithms for the BLMP. 
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